Markov decision process

Results: 537



#Item
291Stochastic control / Partially observable Markov decision process / Markov decision process / Automated planning and scheduling / Bayesian network / FO / S0 / Finite-state machine / Macro / Statistics / Dynamic programming / Markov processes

Exploiting Fully Observable and Deterministic Structures in Goal POMDPs

Add to Reading List

Source URL: www.ida.liu.se

Language: English - Date: 2013-08-29 10:13:42
292Markov processes / Dynamic programming / Markov decision process / Stochastic control / Distribution / Multi-armed bandit / Statistics / Mathematical analysis / Generalized functions

Selecting the State-Representation in Reinforcement Learning Odalric-Ambrym Maillard INRIA Lille - Nord Europe [removed]

Add to Reading List

Source URL: eprints.pascal-network.org

Language: English - Date: 2011-11-02 05:20:38
293Artificial intelligence / Reinforcement learning / Q-learning / Feature selection / Prior probability / Action selection / Markov decision process / One-shot learning / Golden ratio base / Statistics / Machine learning / Probability and statistics

Feature Selection for Domain Knowledge Representation through Multitask Learning Benjamin Rosman Mobile Intelligent Autonomous Systems CSIR, South Africa [removed]

Add to Reading List

Source URL: www.benjaminrosman.com

Language: English - Date: 2014-09-24 09:42:41
294Stochastic control / Human–computer interaction / Control theory / Partially observable Markov decision process / Dialog system / Automated planning and scheduling / Markov decision process / Dialog / Speech recognition / Statistics / Dynamic programming / Markov processes

Partially Observable Markov Decision Processes for Spoken Dialog Systems Jason D. Williams1 Steve Young

Add to Reading List

Source URL: mi.eng.cam.ac.uk

Language: English - Date: 2013-10-14 15:42:14
295Dynamic programming / Stochastic control / Markov models / Expectation–maximization algorithm / Partially observable Markov decision process / Maximum likelihood / Reinforcement learning / Markov chain / Normal distribution / Statistics / Markov processes / Estimation theory

Natural Belief-Critic: a reinforcement algorithm for parameter estimation in statistical spoken dialogue systems F. Jurˇc´ıcˇ ek, B. Thomson, S. Keizer, F. Mairesse, M. Gaˇsi´c, K. Yu, and S. Young Engineering Depa

Add to Reading List

Source URL: mi.eng.cam.ac.uk

Language: English - Date: 2010-11-01 08:12:32
296Estimation theory / Expectation–maximization algorithm / Maximum likelihood / Partially observable Markov decision process / Parameter / Kullback–Leibler divergence / Normal distribution / Bayesian network / Dialogue / Statistics / Statistical theory / Bayesian statistics

PARAMETER LEARNING FOR POMDP SPOKEN DIALOGUE MODELS B. Thomson, F. Jurˇc´ıcˇ ek, M. Gaˇsi´c, S. Keizer, F. Mairesse, K. Yu, S. Young Cambridge University Engineering Department ABSTRACT The partially observable Mar

Add to Reading List

Source URL: mi.eng.cam.ac.uk

Language: English - Date: 2010-11-09 15:42:38
297Stochastic control / Partially observable Markov decision process / Graphical models / Probability theory / Dialogue / Probability / Bayesian network / Markov decision process / Markov chain / Statistics / Markov processes / Dynamic programming

Available online at www.sciencedirect.com Computer Speech and Language[removed]–174 COMPUTER SPEECH AND

Add to Reading List

Source URL: mi.eng.cam.ac.uk

Language: English - Date: 2010-05-02 07:09:58
298Partially observable Markov decision process / Stochastic control / Literature / Domain / Dialogue / Speech recognition / Protein domain / Kernel / Fiction / Statistics / Dynamic programming

POMDP-based dialogue manager adaptation to extended domains M. Gaˇsi´c, C. Breslin, M. Henderson, D. Kim, M. Szummer, B. Thomson, P. Tsiakoulis and S. Young Cambridge University Engineering Department {mg436,cb404,mh52

Add to Reading List

Source URL: mi.eng.cam.ac.uk

Language: English - Date: 2013-08-27 04:45:47
299Statistics / Mathematical optimization / Linear programming / Simplex algorithm / Markov decision process / Robust optimization / Algorithm / Operations research / Mathematics / Applied mathematics

Seminar Series 3108 Etcheverry Hall Berkeley Campus December 1, 2014 3:40pm - 5:00pm Archis Ghate

Add to Reading List

Source URL: www.ieor.berkeley.edu

Language: English - Date: 2014-11-25 16:36:59
300Computational neuroscience / Cybernetics / Reinforcement learning / Q-learning / Temporal difference learning / SARSA / Markov decision process / Unsupervised learning / Recurrent neural network / Machine learning / Neural networks / Statistics

Playing Atari with Deep Reinforcement Learning Volodymyr Mnih Koray Kavukcuoglu

Add to Reading List

Source URL: arxiv.org

Language: English - Date: 2013-12-19 20:23:45
UPDATE